NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Automating multi-task learning on optical neural networks with weight sharing and physical rotation

https://doi.org/10.1038/s41598-025-97262-2

Zhou, Shanglin; Li, Yingjie; Gao, Weilu; Yu, Cunxi; Ding, Caiwen (April 2025, Scientific Reports)

Full Text Available
Secure and Efficient Video Inferences with Compressed 3-Dimensional Deep Neural Networks

https://doi.org/10.1145/3714393.3726505

Liu, Bingyu; Arastehfard, Ali; Wang, Rujia; Liu, Weiran; Ba, Zhongjie; Zhou, Shanglin; Hong, Yuan (June 2025, ACM)

Full Text Available
A Multi-Agent Reinforcement Learning Approach for Safe and Efficient Behavior Planning of Connected Autonomous Vehicles

https://doi.org/10.1109/TITS.2023.3336670

Han, Songyang; Zhou, Shanglin; Wang, Jiangwei; Pepin, Lynn; Ding, Caiwen; Fu, Jie; Miao, Fei (December 2023, IEEE Transactions on Intelligent Transportation Systems)

Full Text Available
Surrogate Lagrangian Relaxation: A Path to Retrain-Free Deep Neural Network Pruning

https://doi.org/10.1145/3624476

Zhou, Shanglin; Bragin, Mikhail A.; Gurevin, Deniz; Pepin, Lynn; Miao, Fei; Ding, Caiwen (November 2023, ACM Transactions on Design Automation of Electronic Systems)

Network pruning is a widely used technique to reduce computation cost and model size for deep neural networks. However, the typical three-stage pipeline (i.e., training, pruning, and retraining (fine-tuning)) significantly increases the overall training time. In this article, we develop a systematic weight-pruning optimization approach based on surrogate Lagrangian relaxation (SLR), which is tailored to overcome difficulties caused by the discrete nature of the weight-pruning problem. We further prove that our method ensures fast convergence of the model compression problem, and the convergence of the SLR is accelerated by using quadratic penalties. Model parameters obtained by SLR during the training phase are much closer to their optimal values as compared to those obtained by other state-of-the-art methods. We evaluate our method on image classification tasks using CIFAR-10 and ImageNet with state-of-the-art multi-layer perceptron based networks such as MLP-Mixer; attention-based networks such as Swin Transformer; and convolutional neural network based models such as VGG-16, ResNet-18, ResNet-50, ResNet-110, and MobileNetV2. We also evaluate object detection and segmentation tasks on COCO, the KITTI benchmark, and the TuSimple lane detection dataset using a variety of models. Experimental results demonstrate that our SLR-based weight-pruning optimization approach achieves a higher compression rate than state-of-the-art methods under the same accuracy requirement and also can achieve higher accuracy under the same compression rate requirement. Under classification tasks, our SLR approach converges to the desired accuracy × faster on both of the datasets. Under object detection and segmentation tasks, SLR also converges 2× faster to the desired accuracy. Further, our SLR achieves high model accuracy even at the hardpruning stage without retraining, which reduces the traditional three-stage pruning into a two-stage process. Given a limited budget of retraining epochs, our approach quickly recovers the model’s accuracy.
more » « less
Full Text Available
Neural population clocks: Encoding time in dynamic patterns of neural activity.

https://doi.org/10.1037/bne0000515

Zhou, Shanglin; Buonomano, Dean V. (April 2022, Behavioral Neuroscience)

Full Text Available
Poster: Cryptographic Inferences for Video Deep Neural Networks

https://doi.org/10.1145/3548606.3563543

Liu, Bingyu; Wang, Rujia; Ba, Zhongjie; Zhou, Shanglin; Ding, Caiwen; Hong, Yuan (November 2022, Proceedings of the 2022 ACM SIGSAC Conference on Computer and Communications Security (CCS))

Full Text Available
PASNet: Polynomial Architecture Search Framework for Two-party Computation-based Secure Neural Network Deployment

https://doi.org/10.1109/DAC56929.2023.10247663

Peng, Hongwu; Zhou, Shanglin; Luo, Yukui; Xu, Nuo; Duan, Shijin; Ran, Ran; Zhao, Jiahui; Wang, Chenghong; Geng, Tong; Wen, Wujie; et al (July 2023, 2023 60th ACM/IEEE Design Automation Conference (DAC))

Full Text Available
EVE: Environmental Adaptive Neural Network Models for Low-power Energy Harvesting System

Islam, Sahidul; Zhou, Shanglin; Ran, Ran; Jin, Yu-Fang; Wen, Wujie; Ding, Caiwen; Xie, Mimi (November 2022, IEEE International Conference on Computer-Aided Design (ICCAD))

Full Text Available
EVE: Environmental Adaptive Neural Network Models for Low-Power Energy Harvesting System

https://doi.org/10.1145/3508352.3549451

Islam, Sahidul; Zhou, Shanglin; Ran, Ran; Jin, Yu-Fang; Wen, Wujie; Ding, Caiwen; Xie, Mimi (October 2022, ACM)

Full Text Available
Encoding time in neural dynamic regimes with distinct computational tradeoffs

https://doi.org/10.1371/journal.pcbi.1009271

Zhou, Shanglin; Masmanidis, Sotiris C.; Buonomano, Dean V. (March 2022, PLOS Computational Biology)
Gutkin, Boris S. (Ed.)
Converging evidence suggests the brain encodes time in dynamic patterns of neural activity, including neural sequences, ramping activity, and complex dynamics. Most temporal tasks, however, require more than just encoding time, and can have distinct computational requirements including the need to exhibit temporal scaling, generalize to novel contexts, or robustness to noise. It is not known how neural circuits can encode time and satisfy distinct computational requirements, nor is it known whether similar patterns of neural activity at the population level can exhibit dramatically different computational or generalization properties. To begin to answer these questions, we trained RNNs on two timing tasks based on behavioral studies. The tasks had different input structures but required producing identically timed output patterns. Using a novel framework we quantified whether RNNs encoded two intervals using either of three different timing strategies: scaling, absolute, or stimulus-specific dynamics. We found that similar neural dynamic patterns at the level of single intervals, could exhibit fundamentally different properties, including, generalization, the connectivity structure of the trained networks, and the contribution of excitatory and inhibitory neurons. Critically, depending on the task structure RNNs were better suited for generalization or robustness to noise. Further analysis revealed different connection patterns underlying the different regimes. Our results predict that apparently similar neural dynamic patterns at the population level (e.g., neural sequences) can exhibit fundamentally different computational properties in regards to their ability to generalize to novel stimuli and their robustness to noise—and that these differences are associated with differences in network connectivity and distinct contributions of excitatory and inhibitory neurons. We also predict that the task structure used in different experimental studies accounts for some of the experimentally observed variability in how networks encode time.
more » « less
Full Text Available

« Prev Next »

Search for: All records